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ABSTRACT 


This  report  desci*ibes  progress  toward  an  "intelligent  question¬ 
answering  system" — a  system  that  can  accept  facts,  retrieve  items  from 
memory,  and  perform  logicul  deductions  necessary  to  answer  questions. 
Two  versions  of  such  a  system  have  beem  implemented,  and  the  authors 
expect  these  to  be  the  first  in  an  evolving  series  of  question 
answerers . 

The  first  system,  QA1,  is  based  upon  relational  information 
organized  in  a  list -structured  memory.  The  data  consist  of  general 
facts  about  relations  as  well  as  specific  facts  about  objects.  QA1 
has  limited  deductive  ability. 

QA2  is  based  upon  formal  theorem-proving  techniques.  Pacts  are 
represented  by  statements  in  the  predicate  calculus.  Although  the 
memory  organization  is  simpler  than  that  of  QA1,  the  sophisticated 
logical  abilities  of  QA2  result  in  greater  question-answering  power. 

The  report  gives  examples  of  the  performance  of  QA1  and  QA2  on 
typical  problems  that  have  been  done  by  previous  question- answerers , 
and  describes  plans  for  extending  the  capabilities  of  QA2. 
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I  INTRODUCTION 


At  Goal s 

This  report  describes  progress  during  the  first  year  of  n  continuing 
project  whose  primary  goal  is  the  development  of  an  intelligent  question- 
ftnsworing  system,  The  point  of  view  of  this  report  is  that  a  question- 
answering  system  is  well-dofinod  formal  system,  exhibited  in  tha  form 
Of  a  computer  program,  that,  has  at  least  the  following  tlU'ee  character¬ 
istics; 

(1)  The  ability  to  accept  statements  of  faot,  possibly 
expressed  in  a  formal  language  such  as  a  predicate 
calculus,  and  storo  them  in  its  memory, 

(2)  The  ability  to  sonrch  stored  information  efficiently 
and  to  reoogni/.e  items  that  nre  relevant  to  a  partic¬ 
ular  query. 

(3)  The  ability  to  respond  appropriately  to  a  question  by 
identifying  and  presenting  the  answer  if  it  is  present 
in  memory,  and  by  deducing  a  reasonable  logical  re¬ 
sponse  from  relevant  knowledge  If  the  complete  answer 
is  not  explicitly  avail  able. 

Our  present  research  is  designed  to  study  basic  problems  involved 
in  making  a  quest  ion- answering  system  i ntoil Igcnt ,  without  reference  to 
particular  potential  applications.  If  such  a  system  can  ho  made  prac¬ 
tical,  one  can  think  of  many  ultimato  applications — such  us  a  flexible 
command  and  control  information  service,  a  reference  librarian's  assis¬ 
tant,  or  a  robot  on  Mars  that  must  be  able  to  make  the  best  autonomous 
decisions  based  upon  limited  information. 

I).  Previous  Work 

Early  work  in  the  general  area  of  quest ion- answering  systems  is 

44. 

competently  surveyed  by  Simmons.1  Additional  and  more  recent  work  lias 


He fere roes  are  listed  at  the  end  of  the  report. 
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boon  reported  by  sovornl  groups  including  Colby  and  K«on,a  McCarthy,9 
Quill  1un,<  .Simmons , *  single,0,  Thompson,7  Craig  etnl.,*  and 
Wclzctkbnum.  0 

Sonic  oi'  the  previous  work,  Including  that,  of  Weir-unbaum  and  Colby 
uhil  Kneu,  has  been  primarily  concerned  with  problems  related  to  the  use 
of  natural  English  as  an  input  language.  McCarthy,  among  others,  has 
published  sovorul  pnpei  a  with  the  iuUrcsUitg  question  of  liow 

<o  formiil  i  7,q  everyday  knowledge.  Quill  Ian  has  Impl  onion  led  u  fully  cross- 
referenced  dictionary,  which  might  be  viewed  us  a  useful  tool  for  (ruthur 
than  an  example  of)  u  question-answering  system,  Simmons,  Raphael , 10 
Thompson,  and  Craig  e_t  al.  seem  to  bo  converging  toward  the  viewpoint 
that  some  sort  of  list  structure  representation  of  relational  data  is 
necessary  in  n  quest  ion-answering  system  as  a  general  semantic  model. 
McCarthy,  iu  his  "advice  taker"  discussions, 11  and  Slagle  and  others  have 
considered  ways  of  applying  formal  logical  techniques  to  question  answer- 
tag,  Row  of  the  previous  projects  have  realistically  considered  the 
problem  of  extracting  and  utilizing  only  the  relevant  duta  from  a  poten¬ 
tially  enormous  store  of  facts. 

Our  current  work  is  based  largely  upon  both  the  semantic  representa¬ 
tions  used  by  Raphael19  and  the  formal  theorom-proving  techniques  devolopod 
by  Robinson19  and  others.  Wo  expoet  to  demonstrate  the  feasibility  of 
quest: ion-answering  systems  that  use  both  list-structure  semantic  models 
and  formal  theorem-proving  techniques  to  store  facts,  extract  relevant 
data,  and  deduce  logical  answers  to  questions, 

C.  Methodology  and  Emphasis 

Our  original  plan  for  this  project  was  to  design,  implement,  experi¬ 
ment  with,  and  evaluate  an  evolving  series  of  versions  of  u  question- 
answering  system.  Each  version  would  be  designed  to  overcome  the  major 
shortcomings  of  its  predecessors,  .and  in  turn  the  process  of  working  with 
each  would  be  expected  to  -  invf  <>?•;  additional  features  to  he  included  in 
1  ts  successors. 


As  Is  woll  known,  the  process  of  designing,  implementing,  und  ex¬ 
perimenting  with  n  largo  computer  program  can  bo  extremal y  tedious  and 
time-oonaumlng.  However,  with  the  invaluable  aid  of  tlu»  LISP  program¬ 
ming  language  in  the  Interactive  environment  of  tho  <}~33  time  sharing 
system,  we  liuvo  succeeded  in  carrying  out  tho  plan  through  two  versions 
of  the  sys?tom--QAl  and  QA2, 

After  surveying  the  literature,  we  decided  that  a  valuable  feature 
missing  from  most  existing  programs  was  a  data  representation  that  could 
contain  general  facts  about  the  relations  in  tho  logical  system  us  woll 
ns  specific  facts  about  tho  objects  In  the  real  problem  domain.  QA1  re¬ 
sulted  from  our  search  for  such  a  memory  organisation.  A  modified  predi¬ 
cate  calculus  was  solccted  to  represent  information  from  a  few  of  tho 
subject  areas  that  had  been  usod  by  previous  question-answering  systems. 
Statements  In  this  iormul  language  wore  placed  on  property  lists  of 
various  key  words.  Facts  woro  rotriovod  by  searching  tlioso  property 
lists,  and  logical  conclusions  were  deduced  by  using  modus  ponona  and 
substitution  as  rulos  of  inference.  The  focus  of  our  work  during  this 
period  was  on  techniques  of  memory  organization. 

Although  QA1  achieved  somo  limited  question-answering  ability,  its 
power  was  limited  because  its  logical  dodaotlve  s ye tom  was  incomplete. 
Therefore  wo  transferred  our  attention  to  the  problem  of  providing  bettor 
formal  proof  procedures  to  opera tu  upon  i he  information  available  in  tho 
ques t ion- answering  system. 

QA2  is  a  quest ion-nnsworing  system  derived  almost  exclusively  from 
formal  theorom-proving  ideas.  It  is  based  upon  the  .1.  A,  Hobiuson  tech¬ 
nique  of  resolution,  und  uses  the  unit  preference  strategy* *  and  other 
heuristics  to  improve  search  efficiency. 

Section  II  of  this  report  discusses  QA1  in  detail.  Section  III 
describes  formal  theorem-proving  techniques,  including  resolution,  and 
their  applicability  to  question  answering.  QA2  is  discussed  in  Sec.  IV. 
Section  V  describes  somo  plans  for  future  systems.  Further  details  of 
the  two  implemented  systems  appear  in  the  appendices. 
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II  QA1--A  QUESTION  answerer  based  on  list-structured  memory 


QA1  was  tho  first  system  Ui|>l  ensouled  under  this  project.  It  was 
based  lurgolv  on  tho  SIR  system  of  Raphael..0  •  10  The  major  advantage  of 
QA1  over  SIR  ilea  in  the  ability  of  QAi  to  hold  In  Ms  list  structure 
memory  logical  suit omenta  about  how  various  kinds  of  Tacts  might  interact. 
Tima  QA1  does  not  require  aa  many  soparato  ad  hoc  (most ion-  msworing 
10 u tinea  as  did  SIR. 

QAI  is  the  first  of  what  wo  expect  to  bu  u  sorios  of  successively 
inoro  powerful  (most ion  answarors;  it  was  novor  intended  to  bo  a  complete 
poll  shod  system.  Thus  sample  problem  domains  wore  borrowed  from  pre¬ 
vious  quest lon-anawor lug  programs,  and  the  control  language  and  logicnl 
deduction  programs  of  q.Al  woro  loft  in  rathor  rough  form.  In  this  sec¬ 
tion  we  describe  the  ubilltios  and  organization  of  QAI,  Further  details 
and  oxamplos  are  given  in  Appondix  A.  Of  course,  many  of  tho  capabilities 
or  QA.l  ere  improved  in  QA2;  theso  aro  described  la  tor  in  this  report . 

A.  Abilities  of  the  System 

In  each  of  several  subjoct.  areas,  QA1  is  capable  of  accepting  facts 
and  then  answering  questions  based  on  those  facts.  The  following  are 
categories  and  examples  of  questions  that  the  system  is  capable  oi  an¬ 
swering,  after  tho  given  facts  have  been  staled  in  an  appropriate  input 
language i 

( 1 )  Pnrt-whoio  rolnt lonahlps 

(a)  Facts:  Every  porson  has  two  arms,  and  every  arm 
has  one  hand,  and  every  hand  has  five  fingers. 

(b)  Questions  answered  by  the  system;  Does  overy 
hand  have  five  lingers?  How  many  fingers  does  a 
person  have? 

(2)  Set-membership 

(a)  Facts :  John  is  an  instance  of  a  person. 

(b)  Quos t.lons  answered;  Is  John  a  person?  John  is 

an  instance  of  what?  Who  is  an  instance  of  a  person? 


•t 


Set- inclusion 


(3) 

(a)  Facts ;  Every  boy  is  a  person. 

(b)  Questions:  Is  every  boy  a  person?  If  John  is  a 
boy,  who  is  an  instance  of  a  person? 

(4)  Spatial- relationships 

(a)  Facts:  The  telephone  is  on  top  of  the  table. 

(b)  Question  to  be  answered:  What  is  on  top  of  the 
table?  (if  any  telephone  can  lie  used  for  com¬ 
munication) ,  what  is  on  top  of  the  table  that  can 
be  used  for  communication? 

(5)  Miscellaneous 

(a)  Facts :  Scott  is  an  airfield  having  an  aspaalt 
runway  which  is  6000  feet  long.  Scott,  Clark, 
and  Dover  are  all  airfields. 

(b)  Questions  to  be  answered:  What  is  Clark?  Does 
Scott  have  a  runway?  Does  Scott  have  a  runway 
length  of  2000  feet? 

B.  Data  Representation 

For  a  discussion  of  some  of  the  reasons  for  choosing  this  represen¬ 
tation,  see  Sec.  V-A.  The  data  consist  of  relational  statements  similar 
to  statements  of  first-order  predicate  calculus.  They  are  represented 
by  list  structures  organized  for  efficient  information  retrieval.  (See 
Kef.  22  for  a  discussion  of  list  structures  and  property  lists.)  Binary 
relations  are  stored  in  memory  as  attribute- value  pairs  on  the  property 
lists  of  the  symbols  that  are  related.  Statements  of  greater  complexity 
than  binary  relations  are  also  stored  on  these  property  lists  under 
special  attributes.  A  sample  of  ti.e  data-base  may  be  represented  as 
follows : 

JOHN  -*  TYPE  (proper  name),  INST  (John,  Boy) 

PERSON  -•  TYPE  (generic  name),  INCLUDE  (Boy,  Person), 
HAS-PART-SET  (person,  arm,  2) 

TELEPHONE  -  TYPE  (generic  name),  INST  (telephone-1, 
telephone),  USE  (telephone,  communication) 

USE  —  TYPE  (relation),  arguments  (2) 
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INST  TYPE  (relation),  COMPUTES  (INST(ST  X  RULE(FRKEVARS (U  V  W) 
PUED( IMPLIES ( AND ( INCLUDE  V  W)(lNST  U  V))(lNST  U  W))))) 

where  the  word  at  the  tail  of  an  arrow  is  an  elementary  symbol.  The  list 
at  the  head  of  an  arrow  is  the  property  list  of  that  symbol,  INST  is  the 
name  of  the  set-membership  relation  and  INCLUDE  is  the  name  of  the  set- 
inclusion  relation. 

These  statements  represent  the  information  that:  "JOHN"  is  the  name 
of  an  object,  belonging  to  the  set  of  Boys;  "PERSON"  is  the  name  of  a  set 
that  includes  the  set  of  Boys;  every  "PERSON"  has  as  parts  a  set  of  two 
arms;  '^Telephone-1"  is  a  member  of  the  set  of  all  telephones;  all  telephones 
are  used  for  communication;  "USE"  is  a  binary  relation;  "INST"  is  a  re¬ 
lation;  and  the  following  axiom  holds;  (y  u)(y  v)(v  w)[vC  w  A  uev  =£>  u£w]. 

C.  Query  Language 

To  interrogate  the  system,  the  user  types  questions  in  one  of  two 
formats,  depending  upon  whether  the  question  is  to  be  answered  by  the 
FINDR  function  or  the  VFIND  function. 

1.  FINDR 

The  function  FINDR  takes  as  arguments  the  name  of  a  relation  and 
arguments  for  that  relation.  Each  argument  for  a  relation,  called  a 
tern; ,  may  be  completely  specified  by  name,  partially  specified  by  a  de¬ 
scriptive  operator,  or  unspecified  (a  "don't-care''  term).  FINDR  will 
attempt  to  find  objects  that  satisfy  the  specifications  for  each  term 
and  for  which  the  relation  holds.  If  it  succeeds,  it  will  return  the 
relation  with  the  names  of  the  appropriate  objects  filled  in.  Although 
FINDR  has  a  limited  degree  of  logical  inference  ability  embedded  in  its 
specif ication-matching  facility,  it  is  primarily  a  memory-searching 
function.  It  also  knows  about  and  deals  with  the  abbreviations  used  in 
the  memoiy. 

Some  examples  of  its  operation  are  given  below.  In  each  case  the 
machine's  output  is  given  on  the  line  following  the  input  (which  starts 
with  FINDR). 
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FINDK ( INST  (TELEPHONE 1  TELEPHONE ) ) 
(INST  (TELEPHONE!  TELEPHONE)) 


Here  all  terms  are  objects  specified  by  name.  The  statement  is 
found  crue  and  FINDU  returns  the  input  unchanged. 

FINDB(INST(DCE  TELEPHONE)) 

(INST  (TELEPH0NE1  TELEPHONE)) 

"DCE"  stands  for  a  don't-care  expression — i.e.,  any  term  that  satis¬ 
fies  the  above  relations. 

FINDR(ONTOP( (ST  X  USE(X  COMMUNICATION) )TABLEl)) 

(ONTOP  (TELEPHONE 1  TABLEl) ) 

The  first  argument  of  the  relation  ONTOP  is  partially  specified  by 
using  the  descriptive  operator  ST  ("such-that") .  The  term  (ST  X  USE(X  COM¬ 
MUNICATION))  means  "the  object  X  such  that  X  is  used  for  communication." 
Some  logical  inference  was  necessary  to  answer  this  question.  The  three 
facts , 

(a)  TELEPHONE!  is  a  telephone, 

(b)  Every  telephone  is  used  for  communication, 

(c)  TELEPHONE 1  is  on  top  of  TABLEl, 

were  used  in  the  course  of  answering  this  question, 

2.  VFIND 

The  function  VFIND  represents  our  first  attempt  at  a  procedure  for 
applying  special  axioms  found  in  the  data.  The  arguments  of  VFIND  are 
a  list  of  variables  and  a  list  of  predicates  containing  those  variables. 

The  program  will  then  attempt  to  find  values  for  the  variables  that  will 
satisfy  all  of  the  predicates.  If  FINDK  fails  to  find  the  answer  by 
searching  memory,  then  VFIND  searches  memory  for  a  relevant  axiom  and 
then  tries  to  deduce  the  answer.  If  FINDU  succeeds,  VFIND  returns  a 
list  of  dotted  pairs  in  which  each  variable  is  paired  with  the  value  that 
satisfies  the  predicates.  The  deductive  procedure  is  described  in  the 
next  section  (d).  Some  examples  of  the  use  of  VFIND  are  given  below. 
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VFIND ( (X) ( INST  JOHN  X) ) 

((X  .  BOY)) 

VFIND( (X) ( INST  X  PERSON)) 

((X  .  JOHN)) 

vfind( (y) (haspahtset  human  finger  y)) 

((Y  .  10)) 

This  last  result  may  Be  interpreted  precisely  to  mean  that  every  member 
of  the  set  of  humans  lias  as  parts  a  set  of  10  elements  from  the  set  of 
fingers.  Each  of  the  above  questions  required  several  step-deductions 
and  the  use  of  special  axioms  found  i  i  memory. 

D.  Deduction  Routines 

If  the  answer  to  a  given  question  was  not  found  in  memory,  the 
program  VFIND  carried  out  the  following  procedure:  Let  us  suppose  the 
input  is 

VFIND ( (X) ( INST  X  PERSON)), 

meaning  "find  some  element  in  the  set  of  persons."  Also,  suppose  that  no 
information  of  the  form 

(INST  MIKE  PERSON) 

is  in  memory.  FINDll  will  thus  fail  to  find  the  answer  in  the  first 
search.  It  will  then  search  for  an  axiom  and  find,  on  the  property  list 
of  INST,  the  rule 

(FREEVARS  (U  V  W)  PRED  (IMPLIES  (AND(lNCLUDE  V  W))  (INST  U  V))  ( INST  U  W) ) ) 

The  consequent,  (INST  U  W)  is  "matched"  to  (INST  X  PERSON)  and  since  the 
match  succeeds,  then  two  subproblems  consisting  of  the  two  predicates  in 
the  antecedent 

(INCLUDE  V  PEUSON)  and 
(INST  JOHN  V) 

are  generated.  Two  predicates  that  match  these  requirements, 

(INCLUDE  BOY  PERSON)  and 
(INST  JOHN  BOY) 
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arc  then  found  in  memory.  The  program  keeps  track  of  variables  and 
returns  the  value 

((X  .  JOHN)) 

The  process  is  recursive  and  at  any  point  in  a  subproblem  it  may  call  for 
special  axioms  to  employ.  In  QA1,  all  axioms  are  in  the  form  of  an  im¬ 
plication,  so  no  additional  rules  of  inference  are  used. 

E.  Evaluation  and  Limitations 

The  system  was  fast,  taking  only  a  few  seconds  of  real  time  for  the 
most  difficult  questions  that  it  was  capable  of  answering.  Exact  machine 
times  are  unknown,  since  the  Q-32  is  a  time-sharing  system  normally  having 
20  to  25  users,  Once  the  program  was  operative,  the  ability  to  deal  with 
new  subject  areas  could  be  added  in  a  few  minutes  by  merely  typing  in  the 
necessary  relations  and  axioms.  Thus  no  reprogramming  would  be  necessary 
to  handle  new  subjects  and  new  questions,  a  problem  that  plagued  Kaphoel's 
SIR  project. Also,  new  and  old  subjects  could  be  Interactive.  The 
program  could  employ  set-membership  information  in  solving  spatial- 
relationship  problems. 

However,  we  felt  that  several  changes  should  be  made  to  the  program. 
The  data  representation  and  memory  organization  were  adequate  but  the  de¬ 
duction  techniques  required  improvement.  The  program  handled  existential 
quantifiers  only  in  certain  cases  and  recognized  only  two  logical  con¬ 
nectives,  AND  and  IMPLIES.  The  functions  VF1ND  and  KINDR  were  not  quite 
compatible,  and,  as  a  result,  the  rule  of  inference 

(Vx)P(x)  ->  P(a) 

could  not  be  applied  in  some  cases.  The  program  had  no  sophisticated 
means  of  preventing  loops  or  picking  the  order  in  which  to  attempt  de¬ 
ductions.  It  tried  a  simple  depth-first  search  of  the  proof  tree  generated 
by  the  deductive  routine  described  in  D  above.  As  a  result  of  these 
limitations,  QA1  could  not  answer  such  questions  as  "How  many  hands  does 
John  have?"  and  "Does  there  exist  a  person  who  is  not  a  boy?"  The  for¬ 
malization  of  these  two  problems  and  their  solutions  by  QA2  are  given  in 
Sec.  IV-D  below. 


To  progress  further,  two  alternatives  were: 

(1)  To  modify  the  current  program  by  correcting  each  defi¬ 
ciency  one  at  a  time  and  experimentally  evolve  more 
sophisticated  deductive  routines,  perhaps  similar  to 
those  of  Fischer  Black's  question-answering  program. 15 

(2)  To  base  our  new  work  upon  relevant  research  in  the 
field  of  automatic  theorem- proving . 

Like  our  questi on-answering  programs,  automatic  theorem- proving 
programs  must  be  logically  complete  and  must  contain  heuristics  for 
selecting  subproblems— i .e. ,  for  searching  "proof  trees"  efficiently. 

To  our  knowledge,  however,  theorem- provers  have  not  been  used  in  a  system 
containing  information-retrieval  capabilities.  It  was  not  clear  just 
how  a  mathematical  theorom-prover  could  l>e  used. 

We  selected  the  second  alternative— adaptation  of  results  in  auto¬ 
matic  theorem  proving— because  of  its  potential  power  to  provide  us  even¬ 
tually  with  a  very  general,  yet  conceptually  simple,  question-answering 
system.  Thus  work  on  QA1  was  abandoned  and  we  proceeded  to  study  how 
theorem- proving  techniques  could  best  be  utilized,  and  then  to  im¬ 
plement  QA2. 


Ill  THEOREM -PROVING  AND  QUEST  I  ON -ANSWER  INC. 


One  of  the  most  important  characteristics  of  n  question-answering 
system  is  its  logical  deductive  ability.  A  system  that,  can  derive  and 
construct  responses  from  its  stored  knowledge  is  far  more  interesting 
than  a  system  that  can  only  parrot  back  responses  that  are  stored  explicitly 
in  its  memory. 

Mathematicians  and  philosophers  have  studied  the  nature  of  implication 
and  deduction,  primarily  in  the  abstract  domain  of  formal  logic.  Most  of 
the  formal  logical  systems  that  have  been  studied  contain  all  the  rea¬ 
sonable  properties  one  might  wish  in  "deducing"  facts  in  a  particular, 
informal  subject  domain,  and  most  formal  systems  can  be  easily  applied, 
with  appropriate  semantic  models,  to  the  particular  subject  domains  of 
interest.  Therefore,  we  decided  that,  instead  of  developing  our  own 
heuristic  deductive  techniques,  we  should  try  to  apply  the  most  powerful 
logical  procedures  available  in  the  mathematics  literature  to  our 
question-answering  problems, 

A.  Introduction  to  Formal  Theorem -Proving  Techniques 

Formal  logic  usually  deals  with  well-defined  strings  of  symbols 
called  "well-formed  formulas"  (wff's),  and  with  a  subset  of  the  wff's 
called  "theorems."  Each  wff  can  be  interpreted  as  a  statement,  that 
may  lie  true  or  false,  about  the  state  of  a  particular  semantic  model. 

The  semantic  domain  may  consist  of  any  individuals  and  relations;  in 
the  absonce  of  specific  semantic  knowledge,  a  domain  consisting  of 
numbers  and  sets  is  frequently  used  as  the  "standard  interpretation," 

A  model  is  said  to  satisfy  a  wff  if  the  statement  represented  by 
the  wff  is  true  for  that  model.  A  wff  that  is  satisfied  by  all  possible 
models  (from  the  semantic  domain)  is  called  valid. 
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Hie  theorems  of  a  logical  system  are  usually  intended  to  bo  the 
valid  wff's.  However,  since  it  is  not  practical  in  general  to  enumerate 
and  test  nil  possible  modols,  formal  syntactic  procedures  called  proof 
procedures  must  be  used  to  establish  theorems.  If  every  theorem  of  a 
proof  procedure  is  indeed  valid,  th<_  procedure  is  colled  sound .  If 
every  valid  formula  can  be  demonstrated  to  be  a  theorem,  the  procedure 
is  complete.  In  the  desirable  case  that  a  proof  procedure  is  both  sound 
and  comploto,  tho  thooroms  of  the  procedure  coincide  with  the  valid  wff's 
A  decision  procedure  is  a  sound  and  complete  proof  procedure  that  can 
effectively  decide  whether  any  given  wff  is  valid  or  not. 

Unfortunately,  a  famous  theorem  by  Oodel  shows  that  any  sufficiently 
rich  and  consistent  formal  system  is  incomplete!  that  is,  there  will 
always  exl9t  wff's  that  are  valid  but  cannot  be  formally  proved  to  be 
valid.  Tills  means  that,  for  the  interesting  formal  systems,  there  can 
be  no  decision  procedure;  we  must  content  ourselves  with  sound  proof 
procedures  that  can  establish  ns  theorems  some,  but  not  all,  of  tho 
valid  wff's. 

As  a  practical  matter,  however,  the  incompleteness  property  is  much 
less  restrictive  than  it  may  at  first  appear.  Because  of  the  time  and 
space  constraints  on  practical  computation,  tho  heuristic  power  of  a 
proof  procedure - -i. e. ,  its  ability  to  prove  useful  theorems  cfficicntly- 
is  more  important  than  its  ultimate  cl lucH veness  on  all  theorems,  A 
decision  procedure  that  requires  enormous  amounts  of  time  or  intermediate 
storage  is  undistinguishable,  in  practice,  from  an  incomplete  proof 
procedure  that  never  terminates  for  some  wff's. 

In  recent  years,  much  work  has  been  done  on  the  development  on  proof 
procedures  suitable  for  implementation  on  a  digital  computor.1* 

The  most  effective  of  these  seem  to  bo  those  that  use  the  Robinson 
resolution  principle  in  conjunction  with  the  HerbrandlT  approach  to 
theorem  proving,  sometimes  called  "semantic  tableau"  methods. 


I),  Tho  Robinson  Procedure  for  Proof  by  Resolution 


The  basic  approach  of  llcrbrand  proof  procedures  Is  to  attempt  to 
construct  a  model  that  satisfies  tho  negation  of  the  wl'f  to  bo  proved. 

Since  every  wff  is  either  true  or  false  for  each  possible  model,  every 
model  must  either  satisfy  a  given  wff  or  else  satisfy  its  negation,  and 
no  modol  can  simultaneously  satisfy  both  a  wff  and  Its  negation.  If  a 
wff  is  valid,  its  negation  cannot  be  satlsflable.  If  the  construction 
of  a  model  for  the  negation  of  a  wff  is  completed,  thon  the  wff  is  not  a 
theorem.  If  the  construction  process  leads  to  obviously  contradictory 
model  assignments,  then  no  satisfying  model  is  possible  and  tho  wff  is 
proved  to  be  valid.  If  the  construction  process  procoods  interminably, 
the  proof  procedure  fails.  (This  is  why  those  proof  procedures  are  not 
decision  procedures.  It  is  known  that  no  decision  procedure  can  exist 
for  the  first-ordor  predicate  calculus.) 

Proof  by  resolution  is  a  ilerbrnnd  typo  of  procedure.  The  negation 
of  the  wff  to  be  proved  is  first  placed  into  n  ^candnrd  form  ( prenex 
conjunctive  normal  form,  in  which  existentially  quantified  variables  are 
replaced  by  Skolcm  functions  of  previous  universally  quantified  variables). 
In  this  form,  the  wff  is  represented  as  the  conjunction  of  a  set  of 
formulas  called  clauses ,  each  of  which  is  a  disjunction  of  elementary 
formulas  called  literals.  Then  new  clausos  ore  deduced  from  the  starting 
clauses  by  the  inference  rule  of  resolution,  such  that  the  original  wff 
is  satlsflable  only  if  its  doscendont  clauses  ore  all  satlsflable.  The 
goal  of  tho  procedure  is  to  deduce  the  empty  formula,  which  is  not 
sacisfiable  and  therefore  demonstrates  that  all  its  antecedents,  including 
the  starting  wff,  are  not  satlsflable. 

Tho  rule  of  resolution  is  best  illustrated  first  in  its  propositional 
form:  if  p v  or  and  -~pv  8  are  two  wff’s  in  which  p  is  any  proposition  and 
a  and  8  arc  any  wfi's,  one  may  deduce  the  wff  cry  8. 

The  predicate  calculus  form  of  the  resolution  rule  is  this:  Jx?t  1,^ 
be  any  atomic  formula — l.e.,  a  wff  consisting  of  a  single  predicate 
symbol  followed  by  an  appropriate  set  of  constant,  variable,  and  function 
symbols  for  arguments.  Let  be  the  negation  of  an  atomic  formula 


consisting  of  the  somo  predicate  symbol  os  L  ,  but  in  genorol  with 
different  Arguments.  Lot  o  and  8  bo  any  wff's  in  the  predicate  calculus. 
Lot  (a)  l>c  the  wff  obtained  from  cv  by  making  oil  substitutions  specified 
by  the  substitution  sot  o,  of  formulas  for  free  Occurrences  of  variables 
in  o.  If  there  exists  any  sot  of  substitutions  v  or  variables  in  1. 
nnd  L„  that  makes  L,.  identical  to  the  negation  of  L, ,  then  from  the 
two  wff's  L  v  a  nnd  l  v  8  we  may  deduce  the  "resolvent"  (o  vB) 

1  w  (Jj 


Example : 


P(x,f(y))  V  Q(x)  V  K(f(r),v) 


nnd 


~p(  f(  1(a)),  St)  V  R(  ,t,w) 


imply,  by  resolution, 

Q(f(  *(«)))  V  R(f(n),y)  v  R(f(y),w) 

whore  the  substitution  set  is  o  =  (f(f(a))  for  x,  f(y)  for  z}. 

The  main  theorem  of  resolution  btotos  that  if  a  resolvent  is  not 
satisfiable  then  noithor  of  its  antecedents  are  satisfiable,  and  that 
the  empty  formula  is  not  sutisfrnble. 

The  resolution  rule  to  1 1  •>  us  how  to  <lerive  a  new  clause  from  a 
specified  pair  of  clauses  containing  a  specified  litoral,  but  does  not 
toll  us  how  to  choose  which  clauses  to  resolve,  A  mechanical  attempt  to 
resolve  all  possible  pairs  of  clauses  generally  rosults  in  the  generation 
of  an  unmanageably  largo  number  of  irrelevant  clauses.  Therefore, 
various  heuristic  search  principles  are  being  developed  to  guide  nnd 
control  the  selection  of  clauses  for  resolution.  Among  the  most  important 
of  these  are  tho  set  of  support,14  unit  preference,  level  hound,  and 
subsumption  strategies. 1:1 

The  statement  of  o  theorem  to  bo  proved  usually  consists  of  a  set 
of  premises  (  axioms )  and  a  conclusion.  The  set  of  support  strategy 
consists  of  designating  the  conclusion,  and  perhaps  a  small  number  ol 
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the  most  rolovnnt  axioms,  os  ''having  tho  T-support  property" — i.o.,  lying 
in  tho  sot  of  support  for  tho  theorem.  Thereafter  only  those  pairs  of 
clausos  containing  ft t  least  one  member  with  T-support  nro  considered 
for  resolution,  and  every  resolvent  is  automatically  attributed  t-lto 
T-support  property.  This  strategy  is  Mimed  nt  avoiding  the*  deduction 
of  consequences  for  some  of  the  promises  that  are  independent  of  (and 
irrelevant  to)  the  particular  conclusion  desired. 

The  unit  preference  strategy  essentially  orders  the  clauses  to  bo 
resolved  by  their  length — i,o.,  by  tho  number  of  literals  they  contain. 
Contradictions  become  apparent  only  when  two  unit  ( ouc-litoral)  clauses 
resolve  together  to  produce  tho  empty  clauso,  Therefore,  one  might  hope 
to  discover  a  contradiction  in  the  least  tlmo  by  worning  first  with  the 
shortest  clauses. 

Occasionally  the  unit  preference  strategy  may  euuse  ono  to  generate 
and  resolve  lengthy,  perhaps  endless,  sequonoes  of  unit,  clauses,  to  the 
neglect  of  longer  but  perhaps  more  fruitful  douses.  This  difficulty 
can  bo  overcome  by  placing  a  bound  on  computation  that  will  determine 
when  the  unit  preference  strategy  should  be  abandoned  in  favor  of  a 
broador  search.  One  such  bound  sots  a  maximum  on  tho  number  of  levels-- 
i.e.,  intermediate  steps,  between  a  deduced  clauso  and  tho  original 
theorem. 

in  the  course  of  resolution  proof,  sovornl  douses  may  bo  introduced 
that  carry  equivalent  information  and  therefore  lead  to  distracting, 
extraneous  stops,  In  particular,  if  C  is  any  clause  ,  and  if  =  (C)c 
is  obtainable  os  on  instance  of  C  by  some  substitution  set  c,  and  if 
clauso  D  -  C  V  o  whore  o-  is  any  formula,  then  C  subsumes  D  in  the  sense 
that  tho  set  of  clauses  (C,D)  is  sotisiiablc  if  and  only  if  C  alone  is 
satisfioble.  Therefore,  wo  should  delete  from  our  proof  any  clauso  that 
is  subsumed  by  another  clauso  in  the  proof. 

The  proof  procedure  implemented  as  part  of  QA2  is  a  resolution 
procedure  using  some  form  of  each  of  tho  nbovo  search  strategies. 
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C.  Utilizing  u  Thcorcm-Provor 

How  may  this  theorem-proving  procedure  ho  usotl  in  n  question -answering 
system?  Wo  plan  to  uso  it  in  several  ways ,  including  the  following 
throe: 

{ l)  Answer  true-false  questions.  To  find  out  if  a  given 

input  sontoncc  Is  true  or  falsG  the  theorem -provor  will 
attempt  first;  to  prove  that  tho  sentence  is  true.  If, 
nftor  a  certain  expenditure  of  effort,  no  proof  is 
found,  the  thoorom-provor  could  then  attempt  to  prove 
the  sentence  false. 

(2)  Find  nn  object  satisfying  certain  conditions.  A  ques¬ 
tion  limy  be  stated  as,  "Find  x  such  that  l>(x)  is  true," 
whore  P(x)  is  some  specified  predicate.  This  problem  may 
be  posed  to  a  theorem  provor  ns  tho  statement  (Ux)(Px). 

If  this  statement  is  proved,  then  the  answer  to  the  ques¬ 
tion  is  tho  term  that  is  substituted  for  x  during  the 
course  of  the  prui>*\  this  torn  m«v  be  n  variable  (signify¬ 
ing  that  l’(x )  is  true  for  all  x),  a  constant,  or  a  function. 

If  tho  clause  representing  P(x)  is  used  several  times  in 
the  proof,  then  tho  answer  is  tho  disjunction  of  the  several 
terms  substituted  for  x.  This  answer  may  then  be  represented 
Internally  as  a  ciauso,  e.g.,  (P{n)  v  P(b)  v  P(x)  v  P(F(y))} 
so  that  the  theorem  provor  may  then  simplify  it  by  romoving 
unnecessary  disjuncts, 

(3)  Ciivo  tho  steps  necessary  to  roach  somo  goal,  ir 
permissible  actions  are  suitably  nxiomatizod,  the 
stops  in  a  proof  that  the  goal  is  nchlovable 
correspond  to  the  sequence  of  actions  that  must  l>© 
taken  to  achieve  it.  Examples  of  such  axioniotlza- 
tions  and  proofs  are  given  by  McCarthy,*  Since 
these  nxiomatlzatlous  usually  involve  second-order 
predicate  calculus  (e,g.,  statements  about  all 
predicates  that  can  cause  certain  situations  to 
occur),  the  Robinson  procedure  is  not  directly 
applicable,  QA'J  cannot  handle  this  typo  of 
problem,  although  we  expect  to  extend  it  so 

that  It  will  have  this  capability. 

Section  IV-D  contain.-:  annotated  examples  of  program  QA2  solving 
several  typical  problems. 


lc 


IV  QA2— an  extended  theorem- prover  as  a  question-answering  system 


This  section  contains  a  general  description  of  QA2,  and  examples  of 
its  operation.  Detailed  definition  of  some  of  the  theorem- prover' s  al¬ 
gorithms  appear  in  Appendix  B  of  this  report.  Listings  of  the  complete 
QA2  programs,  written  in  LISP,  are  available  upon  request. 

A.  General  Organization 


In  a  system  having  a  large  number  of  statements  or  facts  in  its 
memory,  a  key  problem  is  that  of  which  statement  to  use  next  in  solving 
a  problem  (or  proving  a  theorem).  This  problem  is  sometimes  stated  as 
the  problem  of  finding  which  statements  are  "relevant"  to  the  problem 
at  hand. 

A  simple  and  logically  complete  solution  to  this  problem  is  given 
by  an  extension  of  the  set-of- support  strategy:  for  this  approach  every 
"fact"  is  stored  in  the  form  of  a  clause  suitable  for  use  by  the  theorem 
prover. 

(1)  First,  give  the  theorem-prover  only  the  clauses  repre¬ 
senting  the  negation  of  the  sentence  to  be  proved.  All 
clauses  representing  this  negated  sentence  are  given 
T-support.  (Note  that  a  theorem  of  the  predicated  cal¬ 
culus — e.g.,  (yx)[p(x)v  ~  P(x)j--may  be  provable  without  refer¬ 
ence  to  facts  in  memory.) 

(2)  If  no  proof  is  found,  the  theorem-prover  then  addresses 
memory  for  a  limited  number  of  additional  clauses  that 
will  resolve  with  clauses  in  the  theorem-prover  having 
T-support.  (Suitable  memory  organization  and  use  of 
the  subsumption  test  can  be  used  \o  increase  the  effi¬ 
ciency  of  the  search. ) 

(3)  If  no  proof  is  found  witli  the  new  clauses  then  return  to 
Step  2. 

As  in  other  theorem-proving  programs,  heuristics  such  as  a  bound  on 
level  or  computing  time  must  be  used  to  insure  practical  run  times  for 
the  program. 
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This  simple  measure  of  "relevance"  of  one  clause  to  another  is 
whether  or  not  the  clauses  will  resolve.  Note  that  this  process  is 
complete  in  the  sense  that  if  a  proof  exists  (within  the  limitation  on 
time  and  space)  it  will  be  found.  The  process  is  efficient  in  the  sense 
that  some  clauses  that  cannot  lead  to  a  proof  are  never  used,  QA2,  the 
program  described  below,  contains  an  implementation  of  the  above  algo¬ 
rithm.  Although  this  technique  has  the  advantage  of  relative  efficiency 
over  giving  all  the  clauses  in  memory  to  the  theorem-prover ,  several  im¬ 
provements  to  this  technique  will  be  necessary  before  we  produce  a  truly 
practical  system. 

B.  QA2--Control  Language 

The  quest ion- answering  program  for  QA2  consists  of  a  collection  of 
functions  or  subprograms  which  perform  the  various  tasks  necessary  for 
such  a  system.  At  the  top  level  an  executive  program  EXEC  allows  for 
user-machine  interaction  by  accepting  input  in  its  "command  language," 
calling  upon  the  appropriate  function  to  perform  the  desired  operation, 
and  responding  to  the  teletype  user.  At  present,  the  language  accepts 
three  types  of  input:  statements,  questions,  and  commands. 

1.  Statements 

A  statement  is  entered  in  the  following  format: 

S  expression 

where  the  letter  S  signifies  that  the  following  "expression"  is  to  be 
added  to  the  system's  data  base. 

Tlie  expression  is  a  predicate  calculus  statement  such  as 
(IN  JOHN  BOY)  or 

((FA  (X  Y  Z)  (IF  (AND  (IN  X  Y)  (INCLUDE  Y  Z))  (IN  X  Z))) 

The  first  states  that  John  is  a  boy,  or  more  precisely,  that  John  is  an 
element  of  the  set  named  Boy. 

The  second  is  equivalent  to  the  predicate  calculus  statement: 

(Vx)  (Yy)  (Vz)  [  xey  A  yc  z  =>  xez  ] 


! 
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When  a  statement  is  encountered,  it  is  transformed  into  prenex  conjunctive 
normal  form  and  then  filed  in  the  memory  of  the  question-answering  system. 

2.  Questions 


A  question  is  entered  in  a  similar  fashion: 

Q  question 

where  Q  signifies  that  the  predicate  calculus  expression  that  follows  is 
to  be  treated  as  a  quest'.or.  to  the  system.  Here,  the  negation  of  the 
question  is  put  into  conjunctive  normal  form  and  passed  on  to  a  subexecu¬ 
tive  program  EXEC1  which  attempts  to  answer  the  question  based  on  the 
current  information  in  the  data  base.  (Part  D  of  this  section  shows  how 
various  questions  may  be  posed  as  predicate  calculus  expressions. ) 

3.  Commands 

A  series  of  additional  commands  have  been  implemented  which  allow 
the  user  to  interrogate  and  alter  the  system: 

(a)  UNWIND 

After  a  question  has  been  successfully  answered,  the 
UNWIND  command  will  print  the  proof  of  the  answer  given 
to  the  question. 

(b)  CONTINUE 

If  the  system  was  unsuccessful  in  answering  a  question, 
the  CONTINUE  command  will  cause  the  system  to  continue 
searching  for  proof  with  the  level  bound  raised. 

(c)  LIST 

Tlie  command  LIST  PR  where  PR  is  a  predicate  symbol 
will  list  all  of  the  statements  in  the  data  base  that 
contain  the  symbol  PR. 

(d)  FORGET 

Tlie  command  FORGET  PR  S  will  delete  certain  statements 
that  contain  the  predicate  letter  PR  according  to  the 
format  of  S — e.g.  ,  if  S  is  an  integer  n,  the  ntl1  state¬ 
ment  will  be  deleted. 

(e)  FILE 

FILE  F  asks  the  theorem  prover  to  operate  on  a  prepared 
list  F  of  clauses. 
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c 


The  Theorem  Prover 


The  theorem  prover  FNI  accepts  as  input  a  list  of  clauses  CLAUSELIST 
and  a  level  bound  MAXLEV.  Its  goal  is  to  determine,  if  possible,  that  the 
set  of  clauses  is  unsatisf iable ,  or  equivalently,  to  derive  tho  null  clause 
(containing  no  literals)  which  is  the  result  of  two  contradictory  clauses. 

The  algorithm  used  is  the  unit  preference  strategy  with  T-support. 

In  addition,  it  traces  the  values  assigned  to  variables  that  were  originally 
bound  by  existential  quantifiers.  Thus  if  the  theorem  prover  completes  a 
proof  of  a  statement  of  the  form  (ax)  P(x),  the  question  answerer  can 
exhibit  the  x  that  satisfies  P.  This  is  extremely  useful,  as  the  examples 
in  part  1)  below  will  show. 

The  operation  of  the  theorem  prover  starts  by  ordering  the  clauses 
on  CLAUSELIST  by  the  number  of  literals  in  each  clause.  Hie  program  suc¬ 
cessively  attempts  to  produce  resolvents  from  the  clauses  in  CLAUSELIST, 
producing  first  those  resolvents  of  shortest  length.  To  avoid  redundant 
computation  as  much  as  possible,  resolvents  of  two  clauses  Cl  and  C2  are 
produced  only  if  the  following  criteria  are  satisfied: 

(1)  Either  Cl  or  C2  (or  both)  must  have  T-support. 

(2)  The  level  i  of  any  resolvent  of  Cl  and  C2  plus  the 
length  of  the  resolvent  must  not  be  greater  than  the 
level  bound  MAXLEV.  (This  is  a  modification  of  the 
usual  level  bound  strategy. ) 

(3)  Neither  Cl  nor  C2  has  been  subsumed  by  any  other 
clause  in  the  proof. 

Furthermore,  if  a  resolvent  R  of  Cl  and  C2  is  produced,  it  is  added 
to  CLAUSELIST  only  if  K  is  not  a  tautology — i.e. ,  does  not  contain  comple¬ 
mentary  literals--and  if  R  is  not  subsumed  by  any  clause  already  on  the 
list. 


D.  Examples  of  QA2  in  Operation 

A  sample  dialogue  with  QA2  is  given  below.  The  input  and  output  from 
the  computer  are  printed  in  all  capital  letters.  After  some  of  the  ex¬ 
changes,  we  have  added  an  explanation. 
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S  (IN  JOHN  BOY) 

OK 

Ihe  statement  (indicated  by  "S")  that  John  is  contained  in  the  set 
of  boys  is  accepted  and  the  response  is  "OK," 

Q  (IN  JOHN  BOY) 

YES 

The  question  (indicated  by  "q"),  "is  John  in  the  set  of  hoys?"  is 
answered  "Yes."  This  is  an  example  of  a  simple  yes  or  "no  proof  found" 
answer. 


Q  (EX  (X)  (IN  JOHN  X)) 
YES  WHEN  X  =  BOY 


Does  there  exist  an  x  such  that  John  is  in  the  set  x?  Note  that 
the  program  reports  what  assignment  is  made  to  x  to  complete  its  proof. 

S  (FA  (X)  (IF  (IN  X  BOY)  (IN  X  PERSON))) 

OK 

This  says  that  every  boy  is  a  person,  or  (Vx)txeBOY  =>  xePERSONl 

Q  (EX  (X)  (IN  X  PERSON)) 

YES  WHEN  X  =  JOHN 


Does  there  exist  a  member  of  the  set  of  humans?  The  theorem  prover 
must  have  used  two  statements:  John  is  a  boy,  and  every  boy  is  a  person. 


UNWIND 

SUMMARY 

1  IN(JOHN.BOY)  AXIOM 

2  ~IN(X, PERSON)  NEG  OF  THM 

3  -in(x,boy)  in(x, person)  axiom 

4  -IN(X,BOY)  FROM  2,3 

(CONTRADICTION  FROM  CLAUSES  1  AND  4) 

(5  CLAUSES  GENERATED) 
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The  command  NWIND  caused  the  proof  to  be  printed  oi  t.  Each  numbered 

line  corresponds  to  one  clause.  A  clause  may  come  from  three  sources: 

AXIOM  -  retrieved  from  memory 

NEC  OF  THM  -  the  negation  of  the  question 

FROM  N,M  -  tlie  result  of  resolving  together 
clauses  N  and  M. 

The  number  of  clauses  generated  represents  the  size  of  the  proof  tree 
upon  generating  the  empty  clause;  this  is  a  measure  of  the  amount  of  ef¬ 
fort  Involved  In  completing  the  proof. 

S  (FA  (X)  (IF  (IN  X  PERSON)  (IN  X  HUMAN))) 

OK 

It  unquestionlngly  believes  that  all  persons  are  human. 

q  (EX  (X)  (IN  X  HUMAN)) 

YES  WHEN  X  =  JOHN 

S  (FA  (X)  (IF  (IN  X  HUMAN)  (HP  X  ARM  2))) 

OK 

Q  (HP  JOHN  ARM  2) 

YES 

(HP  JOHN  ARM  2)  means  that  John  Has- ao- Parts  two  elements  of  the 
set  of  all  arms. 

S  (FA  (Y)  (IF  (IN  Y  ARM)  (HP  Y  HAND  1))) 

OK 

Q  (EX  (X)  (HP  JOHN  HAND  X)) 

NO  PROOF  POUND 

The  crucial  axiom,  given  next,  was  missing. 

S  (FA  (X  Y  Z  M  N)  (IF  (AND  (HP  X  Y  M) 

(FA  (U)  (IF  (IN  U  Y)  (HP  U  Z  N))))  (HP  X  2S  (TIMES  M  N)))) 

OK 

Q  (EX  (N)  (HP  JOHN  HAND  N)) 

YES  WHEN  N  =  TIMES  (2,1) 

TIMES  (2,1)  represents  the  product  of  2  and  I  (=2), 
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UNWIND 


SUMMARY 


1  IN( JOHN, HOY) 

AXIOM 

2  - HP (JOHN, HAND, N) 

3  IN(SK8(N,M,Z,Y,X),Y)  -  1IP(X,Y,M) 

hp(x,z,times(m,n) ) 

NEG  OF  TIIM 

AXIOM 

4  -HPf JOHN, Y,M)  IN(SK8(N, M, HAND, Y, JOHN), Y) 

5  -IN(Y.ARM)  HP(Y,HAND,1) 

FROM  2,3 

AXIOM 

6  -HP (JOHN, ARM, M)  HI>(SK8(N,M, HAND, ARM, JOHN) .HAND, l) 

7  -HP(SK8(N,M,Z,Y,X),Z,N)  -HP(X,Y,M) 

FROM  4,5 

IIP  (X,Z,  TIMES  (M,N)) 

AXIOM 

8  -IIP(JOHN,Y,M)  -HP(SK8(N, M, HAND, Y, JOHN), HAND, N) 

FROM  2,7 

9  -HP  ( JOHN , ARM , M ) 

FROM  6,8 

10  -IN(X, HUMAN)  HP (X, ARM, 2) 

AXIOM 

11  -IN(JOHN, HUMAN) 

FROM  9,10 

12  -IN(X, PERSON)  IN(X, HUMAN) 

AXIOM 

13  -IN(JOHN, PERSON) 

FROM  11,12 

14  -in(x,boy)  in(x, person) 

AXIOM 

15  -IN(J0HN,U0Y) 

FROM  13,14 

(CONTRADICTION  FROM  CLAUSES  1  AND  15) 

27  CLAUSES  GENERATED) 

This  required  a  8-step  proof.  SK8  Is  the  nnme  generated  by  the  program 
for  a  Skolem  function  used  to  eliminate  an  existential  quantifier. 

S  (OR  (AT  JOHN  HOME)  (AT  JOHN  SRI)  (AT  JOHN  AIRPORT)) 

OK 

Q  (EX  (X)  (AT  JOHN  X)) 

YES  WHEN  X  -  SRI 

OR  WHEN  X  =  AIRPORT 
OR  WHEN  X  =  HOME 

Note  that  the  output  may  be  a  logical  "OR"  of  several  possibilities. 


S  (FA 
OK 

S  (FA 
OK 

S  (FA 
OK 

S  (FA 
OK 


(X)  (EQUALS  X  X)) 

(X  Y)  (IF  (EQUALS  X  Y)  (EQUALS  Y  X))) 

(X  Y  Z)  (IF  (AND  (EQUALS  X  Y)  (EQUALS  Y  Z))  (EQUALS  X  Z))) 
(X  Y  Z)  (IF  (AND  (AT  X  Y)  (AT  X  z) )  (EQUALS  Y  z))) 
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S  (NOT  (EQUALS  SRI  AIRPORT)) 

OK 

s  (not  (equals  airport  home)) 

OK 

0  (EX  (X)  (IF  (NOT  (AT  JOHN  AIRPORT) )  (AT  JOHN  X))) 

YES  WIEN  X  =  HOME 

OR  WIEN  X  SRI 

S  (IF  (AT  JOHN  AIRPORT)  (WITH  JOHN  DILL)) 

OK 

S  (FA  (X  Y  2)  (IF  (AND  (AT  X  Y)  (WITH  2  X))  (AT  2  Y))) 


Q  (EX  (X)  (IF  (AT  JOHN  AIRPORT)  (AT  DILL  X))) 

NO  PROOF  POUND 

S  (FA  (X  Y)  (IF  (WITH  X  Y)  (WITH  Y  X))) 

OK 

Q  (EX  (X)  (IF  (AT  JOHN  AIRPORT)  (AT  BILL  X))) 

\-ES  WHEN  X  =  AIRPORT 

Q  (EX  (X)  (IF  (NOT  (WITH  BILL  JOHN))  (AT  JOHN  X))) 
YES  WHEN  X  =  SRI 

OR  WHEN  X  =  AIRPORT 

OR  WHEN  X  =  HOME 

S  (AT  JOHN  SRI ) 

OK 

q  (NOT  (AT  JOHN  AIRPORT)) 

YES 

S  (FA  (X  Y)  (IFF  (DISJOINT  X  Y)  (FA  (U) 

(IF  (IN  U  X)  (NOT  (IN  U  Y)))))) 

OK 

Q  (FA  (X  Y)  (IF  (DISJOINT  X  Y)  (DISJOINT  Y  X))) 

YES 

S  (DISJOINT  BOY  GIRL) 

OK 

S  (IN  JOHN  BOY ) 

OK 
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Q  (NOT  (IN  JOHN  GIHL)) 
YES 

S  (IN  JUDY  GIRL) 

OK 


S  (FA  (X  Y  2)  (IF  (AND  (IN  X  Y)  (INCLUDE  Y  Z))  (IN  X  '/,))) 


S  (INCLUDE  DOY  PERSON) 

OK 

Q  (2X  (X)  (IN  X  PERSON)) 

YES  WHEN  X  =  JOHN 

S  (INCLUDE  GIRI,  PERSON) 

OK 

Q  (EX  (X)  (AND  (NOT  (IN  X  BOY))  (IN  X  PERSON))) 
YES  WHEN  X  =  JUDY 


UNWIND 


SUMMARY 

1  disjoint(boy,girl) 

2  INCLUDE (GIRL, PERSON) 

3  IN( JUDY, GIRL) 

4  IN(X.BOY)  -IN(X, PERSON) 

5  -INCLUDK(Y.Z)  ~in(x, y) 

in(x,z) 

6  IN (X, BOY)  ~IN(X, Y) 

- I NCLUDE ( Y , PERSON ) 

7  -I NCLUDE (GIRL, PERSON)  IN( JUDY, BOY) 

8  IN(JUDY.BOY) 

9  -DISJOINT(X,Y)  -IN(U,X) 

-IN(U,Y) 

10  -IN(JUDY,Y)  -DISJOINS  BOY ,Y) 

11  -IN( JUDY, GIRL) 

(CONTRADICTION  FROM  CJiAUSES  11  AND  3) 
(92  CLAUSES  GENERATED) 


AXIOM 
AXIOM 
AXIOM 
neg  of  him 

AXIOM 

FROM  4,5 
FROM  3,6 
FROM  2,7 

AXIOM 
FROM  »,!) 
FROM  1,10 


V  CONCLUSIONS  AND  SPECULATIONS 


A.  The  Prol)lojii_of  Data  Hoprosontu  1  Ion 

Suppose  wo  wish  to  store  in  u  computer  somo  items  of  Information, 
which  wo  shall  call  "the  data,"  for  use  by  a  <iuest  ion-answer  inn  system. 
This  data  may  have  as  its  source  English  text,  logical  rules,  pictures, 
etc.  Thu  problem  of  representing  this  data  may  bo  divided  into  throe 
parts: 

(1)  Determining  the  semantic  content  of  the  data.  For 
example,  we  may  decide  that  the  somantics  of  tho 
sentence,  "John  is  tho  father  of  Dill,"  is  exprossod 
by  the  binary  relation  "is-the- futhor-of "  applied 

to  the  objects  named  "John"  and  "Bill." 

(2)  Choosing  a  language  In  which  to  oxpross  this  sonumtic 
content.  For  example,  wo  may  use  tno  notation  of 
first  order  logic  and  pick  appropriate  symbols--!. c. , 

Father  (John,  Bill) 

(3)  Choosing  a  memory  organization— i.e. ,  a  way  to  repre¬ 
sent  statomonts  in  tho  computer  memory.  In  tho  LISP 
programming  system,  for  example,  statements  would  be 
stored  in  linked  list  structure,  possibly  using  LISP 
atomic  symbols  as  entry  points — e.g.,  on  tho  property- 
list  of  tho  atom  1  John  wo  could  place  the  value 
"Bill'1  under  the  attribute  "Father." 

In  expressing  the  semantic  content  of,  say,  a  sentence  of  English, 
we  are  deciding  what  information  that  sentence  can  provide  for  the 
question-answer Ing  system.  More  specifically,  we  are  restricting  the 
set  of  statements  that  may  be  deduced  from  the  representation  of  that 
sentence.  Thus  a  criterion  that  should  bo  used  in  specifying  semantic 
content  is:  Will  the  system  be  able  to  correctly  answer  questions  con¬ 
cerning  the  subject  matter  of  that  sentonco? 

Tha  language  mentioned  in  (2)  above  should  be  selected  to  represent, 
unambiguously  and  compactly,  the  semantic  content  of  the  data.  This 


language  should  also  ho  such  that  we  can  write  a  set  of  rules  (a  program) 
to  manipulate  the  facts  cxpi-ossed  in  this  language  in  order  to  produce 
tile  desired  answer. 

It  is  not  yet  clear  what  criteria  to  vise  in  selecting  the  memory 
organization  or  data  structure  l'or  a  given  language.  In  particular,  the 
representation  of  «  given  statement  could  he  either  one  or  tho  other  of 
the  following: 

(1)  In  such  a  form  that  tho  structure  of  the  statement 
itself  aid  in  information  retrieval. 

(2)  In  such  a  form  that  stored  items  may  bo  oasily  used 
in  the  deductive  routines— e.g. ,  as  clauses  for  a 
proof  procedure.  In  this  case  additional  structure 
must  bo  provided  to  aid  in  Information  retrieval. 

In  any  case,  it  soon  becomes  difficult  to  study  the  question  of  momory 
organization  apart  from  tho  question  of  problem-solving  or  deductive 
techniques . 

Our  work  thus  far  suggests  that  tho  fastest  progross  may  bo  obtained 
by  first  planning  problem- sol vine  techniques  and  deductive  routines,  then 
choosing  a  language  to  represent  tho  data  that  is  compatible  with  thoso 
techniques,  and,  finally,  selecting  a  memory  organization  compatible  with 
the  problem-solving  techniques  and  languages  chosen.  However,  since  the 
efficiency  of  memory  organization,  language  representation,  and  problem¬ 
solving  techniques  are  highly  interdependent,  we  expect  future  versions 
of  our  question- answering  systems  will  exhibit  changes  in  all  these  us- 
poevs . 

At  our  current  stage  of  work,  tho  greatest  advances  in  question- 
answering  ability  seem  to  depend  upon  improved  logical  problem-solving 
ability.  The  choice  of  internal  language  and  data  structures  has  not 
yet  been  critical  to  the  performance  of  the  systems. 

U.  Improvements  to  QA2 

At  the  time  ot  this  wi’iting,  QA2  is  an  operational  program  with  the 
ability  to  accept  and  store  facts  in  the  form  or  predicate  calculus 
statements,  and  to  answer  questions  by  (l)  selecting  relevant  facts  from 
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its  nunnery,  (a)  operating  a  theorem-proving  program  on  no  loo ted  facts, 
and  (3)  keeping  track  of  tho  bindings  of  vartubios  made  in  the  course  of 
a  proof.  Howevor,  the  moasuro  of  relevance  for  selecting  facts  is  very 
rudimentary;  the  theorem  prover  is  the  Ho  Mason  procedure  with  t»  few 
basic  search  heuristics;  and  the  tracking  of  variables  is  only  a  first 
stop  toward  constructing  answers  to  complex  questions.  In  each  of  those 
uroas,  sovorul  potential  modifications  are  apparent  that  would  result,  in 
improved  quest ion- answering  ability, 

1 .  Selecting  Kolovuut  Facts 

The  present  criterion  of  relevance  is,  "is  the  scloctod  fact  resolv¬ 
able,  by  the  theorom  prover,  with  any  clause  having  T" support V"  As  the 
data  base  grows,  more  stringent  relevancy  criteria  will  have  to  be  usod. 
One  new  relevancy  measure  we  plan  to  use  is  tho  number  of  constants  and 
predicate  symbols  in  oommon  between  tho  fact  in  memory  and  the  clause 
with  which  it  resolves.  Of  course,  tho  efficient  use  of  such  conditions 
implies  that  facts  in  momory  must  bo  indexed  or  sorted  in  appropriate  ways 

2,  Improving  tho  Theorem  Prover 

Sovorul  modifications  to  tho  theorem  prover  would  result  in  signif¬ 
icant  reductions  in  running  time  with  no  doorcase  in  effectiveness. 

Theoretical  work  by  Hart*’*  Koblnson , 31  and  Slagle,  ,p  indicates  that 
many  unnecessary  stops  are  performed  by  a  basic  theorem  prover  such  as 
the  one  we  have  implemented.  Variations  of  techniques  to  avoid  the  extra 
steps  have  been  called  "Elimination  of  equivalent  proofs,"  "Finding 
maximal  clnshos,-1  and  "Making  a  theorem  prover  singly  connoctod."  Wo 
feel  that  the  increased  efficiency  resulting  from  implementing  one  of 
thoso  procedures  will  probably  Justify  tho  increased  overhead  necessary 
for  bookkeeping. 

Another  major  increase  in  theorem- proving  efficiency  would  result 
from  incorporating  into  the  theorem  prover  some  special  knowledge  about 
certain  relations.  If  equality  is  defined  by  three  axioms  in  the  system 
to  be  an  equivalence  relation,  then  every  proof  involving  equality  will 
make  many  superfluous  references  to  those  axioms  merely  because  of  their 


extreme  generality.  These  extraneous  proof  steps  would  be  avoided  if 
special  information  about  the  significance  of  equality  were  built  right 
into  the  theorem-proving  program — e.g,,  if  we  are  given  that  f(b)  =  f(a), 
then  P(a,  x,  f(x))  should  resolve  directly  with~P(a,  b,  f(a)),  without 
intermediate  use  of  equality  axioms.  Similarly  set- inclusion  and  set- 
membership  relations  occupy  distinguished  roles  in  most  logical  deduc¬ 
tions  and  thus  should  be  given  special  treatment. 

3 ,  Increasing  the  Range  of  Question  Types 

Currently  QA2  can  answer  two  types  of  questions:  "is  S  true?"  is 
answered  by  presenting  the  predicate  calculus  sentence  S  to  the  theorem- 
prover;  "Find  an  x  satisfying  P(x)"  is  answered  by  presenting  the  formula 
(3x)  P(x)  to  the  theorem  prover  and  tracking  the  values  it  assigns  to  x. 

We  plan  to  study  how  the  system  can  be  extended  to  handle  additional 
question  types.  In  particular,  many  axiomatizations  of  interesting 
problems,  such  as  problems  whose  solutions  consist  of  a  sequence  of  steps 
or  actions  require  second-order  predicate  calculus.  We  believe  we  can 
extend  the  resolution  technique  to  handle  certain  forms  of  second-order 
theorems . 

C .  Further  Plans 

We  have  discussed  above  two  experimental  question-answering  systems: 
QA1,  on  which  work  has  stopped,  and  QA2,  which  is  operational  but  for  which 
various  improvements  are  now  in  progress.  Our  work  on  these  systems  has 
suggested  several  major  innovations  that  we  expect  to  incor’  orate  into 
future  question-answering  systems. 

1 .  Extensions  of  the  Logical  Notation 

Various  notational  devices,  some  of  which  can  be  defined  as  abbre¬ 
viations  in  conventicnal  logical  systems,  could  have  a  special  significance 
in  a  question- answering  system.  For  example,  consider,  in  addition  to 
the  usual  universal  and  existential  quantifiers  (wx)  and  (3x),  the  fol¬ 
lowing  special  quantif ier-type  operators: 
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'A--'  ; 


There  exists  a  unique  element  (a.'x) 

All  x  in  the  set  y  (Vxey)  epsilon 

/_  .  '  quantifiers 

There  exists  an  x  in  set  y  (3xeyJ 

Any  object  x  (ST  x) 

The  set  of  all  x  (STA  x) 

’•  unique  object  x  (STi  x)  descriptive 

operator . 

The  descriptive  operator  is  discussed  by  Kalish  and  Montague.20 
The  "st"  (such  that)  operator  was  useful  in  the  QA1  representation  and 
query  language. 

Epsilon  quantifiers  can  be  helpful,  not  only  in  expressing  facts  and 
questions  more  naturally,  but  also  in  directing  search  procedures.  Such 
use  of  quantifiers  that  reduce  the  size  of  the  search  space  can  be  even 
more  effective  if  the  theorem  prover  has  special  built-in  knowledge  of 
the  properties  of  the  relevant  relations. 

2 .  Enhancement  of  the  Man-Machine  Interface 

The  operation  of  the  system  should  be  made  more  convenient  and  more 
transparent  to  the  human  user.  The  interface  language  should  eventually 
be  made  more  English-like.  (The  QA2  interface  language  is  already  slightly 
more  natural  than  standard  logical  prefix  notation.)  In  addition,  the 
system  should  be  made  more  interactive.  For  example,  the  system  should 
be  able  to  report  why  or  how  it  gets  into  difficulty  on  a  proof,  and  re¬ 
quest  assistance  when  necessary. 

3 .  Use  of  New  Planning  and  Search  Heuristics 

Slagle  19  reports  a  technique  for  finding  clashes,  and  thus  eliminating 
equivalent  proofs,  by  making  use  of  semantic  models  for  the  logical  system. 
A  generalization  of  this  technique  might  permit  the  theorem  prover  to  use 
facts  in  the  question-answerer's  memory  as  a  guide  to  determining  which 
steps  to  try  next.  Extensive  work  remains  to  be  done  on  how  a  theorem- 
prover  can  best  utilize  semantic  information. 
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Better  planning  heuristics  could  significantly  reduce  the  time  used 
to  produce  a  proof  (and  thus  answer  a  question).  The  choice  and  proof 
of  appropriate  lemmas  as  "stepping  stones,"  along  with  an  extended  set 
of  support  strategy,  could  provide  a  sense  of  direction  that  is  missing 
from  most  current  theorem- pro ving  programs. 
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Appendix  A 
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OPERATION  OP  QA1 


1.  Data  File 


All  the  data  contained  in  memory  may  be  shown  by  listing  the  file 
"Dictionary."  This  computer  output  is  shown  below.  The  first  word  in 
each  list  is  a  LISP  atomic  symbol  and  the  rest  of  the  list  is  the  prop¬ 
erty  list  of  that  symbol. 

LIST  DICTIONARY 

LISTING  OP  PILE  DICTIONARY 

(TEU  TYPE  PROPNAME  INST  (TEL1  TELEPHONE)  ONTOP  (TEL1  TABLE1 ) ) 

(TYPE  TYPE  RELATION  ARCS  2  EQUIVPORM  ((PL  EQUIVFORM  RULE  (FREEVARS 
(XY)  PRED  (EQUIV  (PL  X  TYPE  Y)  (TYPE  XY) ) ) ) )  ) 

(MIKE  TYPE  PROPNAME  INST  (MIKE  PERSON)) 

(TABLE1  TYPE  PROPNAME  INST  (TABLE1  TABLE)  ONTOP  (TEU  TABLE1 ) ) 

(GIRL  TYPE  GENPHYS  INCLUDE  (GIRL  PERSON)) 

(PLUSS  TYPE  FUNCTION) 

(IIASPART  TYPE  RELATION  ARCS  2  INPUT  ( (STF  (X  Y) 

TYPE  (X  OBJECT)  TYPE  (Y  OBJECT'));) 

(SAM  TYPE  PROPNAME) 

(HASPARTSET  TYPE  RELATION  DEFINITION  ((PL  DEFINITION  INTERMSOF 
(HASPART  CARDINALITY) 

RULE  (FREEVARS  (X  Y  Z  N) 

PREDICATE  (EQUIV  (HASPARTSET  X  (Y  »)) 

( (XIRTSU  Z  INST  (Z  Y) ) 

(FORALL  IV  INST  (W  Z)) 

(AND  (HASPART  X  VI)  (CARD  Y  N) )))))) 

COMPUTES  (HASPARTSET  (ST  X  RULE  (FREEVARS  (X  V  Z  M  N) 

PRED  (IMPLIESS  (ANDD  (HASPARTSET  X  (L  Y  M)) 

(HASPARTSET  Y  (L  Z  N) ) ) 

(HASPARTSET  X  (L  Z  (EV  TIMESS  M  N)))))))) 
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(AIRFIELD  TYPE  GENPUYS  INST  ((XLIST  SOOTTT  CLARK  DOVER)  AIRFIELD) 
HASPART  (AIRFIELD  (A  RUNWAY))) 

(FINGER  TYPE  GENPUYS  HASPARTSET  (RAND  (L  FINGER  5))) 

(USE  TYPE  RELATION  ARCS  2) 

(PERSON  TYPE  GENPUYS  INCLUDE  (BOY  PERSON)  HASPARTSET  (PERSON  (L  ARM  2))) 

(BOY  TYPE  GENPUYS  INST  (JOHN  BOY)  INCLUDE  (BOY  PERSON)) 

(ARM  TYPE  GENPUYS  HASPARTSET  (ARM  (I.  HAND1 ) )  HASPARTSET  (PERSON 
(L  ARM  2))) 

(RUNWAY  TYPE  GENPUYS  HASPART  (AIRFIELD  (A  RUNWAY))) 

(BOOK1  TYPE  PHOPNAME  INST  (I300K1  UOOK) 

COLOR  (BOOK1  (ST  X  INST  (X  MIXEDOOLOR(  PART  (X  RED)  PART  (X  BLUE)))) 

(TIMESS  TYPE  FUNCTION  ARGS  INDEF  OOMPUTESL  TIMES) 

(HAND  TYPE  GENPUYS  HASPARTSET  (HAND  (L  FINGER  5)) 

HASPARTSET  (ARM  (L  RANI)  I))) 

(SOOTT  TYPE  PROPNAME  INST  (SOOTT  AIRFIELD) 

HASPART  (SOOTT  (ST  X  INST  (X  RUNWAY) 

SURFACE  ASPHALT  LENGTH  (OOOO  FEET)))) 

(JOHN  TYPE  PROPNAME  INST  (JOHN  BOY)) 

(INST  TYPE  RELATION  COMPUTES  (INST  (ST  X  RULE  (FREEVARS  (U  V  W) 

PRED  ( IMPLTKSS  ( ANDD  ( INCLUDE  V  W)  (INST  U  V) )  (INST  H  »))>))) 

(TELEPHONE  TYPE  GENPUYS  INST  (TELl  TELEPHONE) 

USE  (TELEPHONE.  COMMUNICATION)  USE  PAPERWEIGHT 

END  OF  FILE  DICTIONARY 

2.  Results  with  QA1 

With  the  data  base  given  in  the  above  "Dictionary,"  the  following 
question  and  answers  session  was  conducted.  The  first  line  in  each 
group  is  input,  followed  by  the  system's  response. 
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FINDR(  ONTOP(  TEU  TABLE  1  / ) 

(ONTOP  (TEU  TABLE1 ) ) 

FINDR(  ONTOP (  XXA  TABLE1 ) ) 

(ONTOP  (TEU  TABLE1 ) ) 

PI NDR (ONTOP (  TEL1  XXA)) 

(ONTOP  (TEU  TABLE1)) 

PINDK(ONTOP  (TEU  TABLE 2 ) ) 

NIL 

FINDR(  ONTOP (  TEL2  TABLE1 ) ) 

NIL 

PINDR( INST  (  XXA  TELEPHONE)) 

(INST  (TEU  TELEPHONE)) 

F1NI»R(  TYPE(  TEU  XXA)) 

(TYPE  PROPNAME) 

FINDR(  INST(  MIKE  XXA)) 

(INST  (MIKE  PERSON)) 

FINDR(  INCLUDE(  BOY  XXA)) 

(INCLUDE  (BOY  PERSON)) 

FINDR(  TYPE  (PLUSS  XXA)) 

(TYPE  FUNCTION) 

FINDR(  TYPE(  SAM  XXA)) 

(TYPE  PROPNAME) 

FINDR(  DEFINITION (  liASPARTSET  DCE ) ) ) 

(DEFINITION  ((PL  DEFINITION  INTERM SOF  (HASPART  CARDINALITY) 

RULE  (FREEVARS  (X  Y  7,  N) 

PREDICATE  (EQUIV  (HASPARTSET  X  (YN)) 

( (XISTSU  Z  INST  (Z  Y)) 

(IORALL  W  INST  (W  Z))  (AND  (HASPART  X  W)  (CARD  Y  N)))))))) 

FJNDR(  INST  (CLARK  AIRFIELD)) 

(INST  ( (XLIST  SOOTTT  CLARK  DOVER)  AIRFIELD)) 

FINDR( HASPART  (AIRFIELD  (A  RUNWAY))) 

(HASPART  (AIRFIELD  (A  RUNWAY))) 

F I NDR ( HASPARTSET  ( HAND  (L  FINGER  5))) 

(HASPARTSET  (HAND  (L  FINGER  5))) 
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FINi>R(  HASPARTSET(HAND(L  DOE  6))) 

(IIASPARTSET  (HAND  (L  FINGER  5))) 

findu(  iiaspartset  (dce(l  FINGER  6))) 

(IIASPARTSET  (HAND  (L  FINGER  0))) 

F I NDR ( IIASPA RTSET ( HAND ( L  FINGER  DCE  ) )  ) 

(IIASPARTSET  (HAND  (L  FINGER  5))) 

FI  NDR  (  IIASPARTSET  (  AKM(  L  HAND  DCS))) 

(IIASPARTSET  (ARM  (L  HAND  l))) 

findr(haspartset(  arm(l  FINGER  DCE)) 

NIL 

wNotc  that  FI NDR  cannot  solve  this,  whereas  VFIND  con  (soe  below). 

FINDR(  HASPART  (SCOTT  DCE))) 

(IIASPART  (SCOTT  (ST  X  INST  (X  RUNWAY) 

SURFACE  ASPHALT  LENGTH  (COOO  FEET)))) 

FINDU(HASPART  (SCOTT  (A  RUNWAY))) 

(HASPART  (SCOTT  (ST  X  INST  (X  RUNWAY) 

SURFACE  ASPHALT  LENGTH  (GOOD  FEET)))) 

FINDR(USE  TELEPHONE  XXA  ) ) 

(USE  (TELEPHONE  COMMUNICATION)) 

FI NDR (USE (TELEPHONE  PAPERWEIGHT) ) 

(USE  PAPERWEIGHT) 

FINDR(  ONTOP  ((ST  X  USE(X  COMMUNICATION))  TAULE1 ) ) 

(ONTOP  (TELl  TABLE  1 ) ) 

FI  NDR  ( ONTOP  ( (  ST  X  USE  (X  PAPERWEIGHT)  )  TABLED) 

(ONTOP  (TELl  TABIjEI  ) ) 

FI  NDR  (ONTOP  ((ST  X  USE  PAPERWEIGHT)  TABI.E1  ) ) 

(ONTOP  (TELl  TAULE1)) 

FT  NDR (ONTOP  (TELl ( ST  X  INST(X  TABLE)))) 

(ONTOP  (TELl  TABLED) 

F1NDR(  HASPART(  AIRFIELD  (ARUNWAY))) 

(IIASPART  (AIRFIELD  (A  RUNWAYD) 

FI NDR ( HASPART (  EACH  AIRFIELD) (A  RUNWAY ) ) ) 

(IIASPART  (AIRFIELD  (A  RUNWAY))) 

FINDR (IIASPART  (  (  ONE  AIRFIELD)  (A  RUNWAY))) 

NIL 
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F1NDR(  HASPART(  AIRFIELD  (ST  X  INST  (X  RUNWAY)))) 

(HASPART  (AIRFIELD  (A  RUNWAY))) 

FINDR(HASPAhT(SOOTT  (A  RUNWAY))) 

(HASPART  (SCOTT  (ST  X  INST  (X  RUNWAY) 

SURFACE  ASPHALT  LENGTH  (6000  FEET)))) 

findk(haspart  (soon-  (ST  X  inst(x  RUNWAY)  SURFACE  ASPHALT))) 
(HASPART  (SCOTT  (ST  X  INST  (X  RUNWAY) 

SURFACE  ASPHALT  LENGTH  (6000  FEET)))) 

F INDR (HASPART (SCOTT  (ST  X  LENGTH ( 6000  FEET)))) 

(HASPART  (SCOTT  (ST  X  INST  (X  RUNWAY) 

SURFACE  ASPHALT  LENGTH  (6000  FEET)))) 

FINDR(HASPART  (SCOTT  (ST  X  LENGTH  (5280  FEET)))) 

NIL 

FINDR(HASPART  (SCOTT  (ST  Z  LENGTH (6000  FEET)  SURFACE  ASPHALT)))) 
(HASPART  (SCOTT  (ST  X  INST  (X  RUNWAY) 

SURFACE  ASPHALT  LENGTH  (6000  FEET)))) 

VFINI)(  (X)  (ONTOP  TEL1  '/,)) 

((Z  .  TABLE1 ) ) 

VFIND(  (Q)  (ONTOP  Q  TAGLE1 ) ) 

( (Q  .  TELl ) ) 

VFINI>(  (X)  ( INST  X  TELEPHONE ) ) 

((X  .  TEU)) 

VFINI)(  (X)  ((INST  X  TEIJ3PJIONK )  (ONTOP  X  TABLE1  ) ) ) 

(FAILFIND  (ONTOP  X  TADLE1 )  ((X  .  TEU))) 

vfind( (x) (haspartset  ahm(l  finger  X))) 

((x  .  5)) 

VFIND((X) (HASPARTSET  ARM(I.  FINGER  X))) 

((X  .  5)) 

VFIND(  (X) (HASPARTSET  HUMAN  (L  FINGER  X))) 

((X  .  10)) 

VFIND(  (X)  (INST  JOHN  X)) 

((X  .  HOY)) 

VFIND(  (X)  (INST  X  PERSON)) 

((X  .  JOHN)) 
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3,  Internal  Quern lion 


In  normal  program  operation,  only  the  system  responao  to  a  question 
is  printed  out.  By  using  the  LISP  function  "TRACE,"  sonto  of  the  internal 
operations  nro  shown.  The  function  is  applied  to  VFAl  and  gives  its 
arguments  and  values  during  the  course  of  answering  n  question.  The 
question  asked  below  wns  discussed  in  Soc.  1I-D,  tDNC  moons  "don't 
enre,"  UNK  means  "UNKNOWN,"  and  ALG  means  "algorithm"). 

VFINU(  (X)  (INST  X  PERSON)  ) 

ARGS  OF  VFAl 
(INST  X  PERSON) 

((X  .  UNK)) 

ARGS  OF  VKA1 
(COMPUTES  INST  ALG) 

((ALO  .  UNK)) 

VALUE  OF  VKAi 

((ALG  ST  X  RULE  (FREEVARS  (U  V  W) 

PRED  (IMPLIBSS  (ANUD  ( INCLUDE  V  W)  (INST  U  V))  (INSTU  W))))) 

ARGS  OF  VFAl 
(INCLUDE  V  PERSON) 

((V  .  DNC)  (U  .  UNK)) 

VALUE  OF  VFA1 
((V  .  BOY)) 

ARGS  OF  VFA1 
(INST  U  V) 

((V  .  BOY)  (U  .  UNK)) 

VALUE  OF  VFAl 
((If  .  JOHN)) 

VALUE  OF  VFAl 
((X  .  JOHN)) 

((X  .  JOHN)) 

VFIND(  (X)  (INST  JOHN  X)) 

ARGS  OF  VFAl 
(INST  JOHN  X) 

((X  .  UNK)) 

VALUE  OF  WAl 
((X  .  BOY)) 

((X  .  BOY)) 
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Appendix  13 

aihorithms  kou  qA2 

Complete  listings  for  prog  mm  QA2  are  available  from  tho  authors 
upon  request .  Here  wo  inoludo  descriptions  of  some  of  the  koy  algorithms 
that  aro  embodied  in  tho  QA2  theorem- proving  program.  The  Prunox  Algo- 
rithm  translates  input  sta tomcats  and  questions  into  tho  standard  form 
for  tho  thcovom- proven .  Tho  Subsumption  Algorithm  is  used  whenever 
possible  to  eliminate  newly  generated  clauses.  Tho  Unification  Algo- 
ritlun  is  the  match  proooduro  at  tho  heart  of  the  resolution  method. 


* 

I 


1.  The  Pronex  Algoritlun 

The  function  (PUENEX  E)  produces  the  pronex.  conjunctive  normal 
form  for  the  first-order  predicate  calculus  statement  E,  The  form  of  E 
may  be  one  of  the  following: 


(1)  Literal:  E 

(2) 


and  nl, 
of  P. 


i  (P  tl  ...  an)  where  P  is  a  predicate  symbol, 
an  represent  tonus  which  ure  tho  arguments 


en) 


Conjunction:  E  =  (AND  el  ...  on)  where  the  el  can  again 
bo  any  well  formed  statements.  The  expression  (AND  oi  ., 
represents  the  logical  statement  el  A  e2  A  . , .  A  en. 

(3)  Disjunction:  E  =  (OR  el  ...  en)  which  represents  the 
statement  ol  y  o2  \  . . .  v  en. 

(*l)  Implication:  E  »  (IMP  el  o2)  or  E  =  (IF  ol  e2)  which 
represent  the  statement  el  =  >  c2. 

(5)  Equivalence;  E  s  (EQV  cl  c2)  or  K  =  (IFF  el  o2)  which 
represent  the  statement  el  <=>  ell. 

(6)  Negation:  E  =  (NOT  e)  representing  the  statement  ~e. 

(7)  Universal  Quantification:  E  =  (FA  (xl  x2  ...  xn)  e) 

whoro  xl,  x2,  ....  xn  are  any  variables.  (FA  (xl  ...  xn)  o) 
represents  the  logical  statement  (y  xl)  (-  x2)  ...  (y  xn)o. 

(D)  Existential  Quantification;  E  =  (EX  (xl  x2  ...  xn)  c) 
represents  the  statement  f St  xl)  (3x2)  ...  (a  xn)e. 
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Any  symbol  which  occurs  in  a  literal  and  which  has  not  been  quanti¬ 
fied  is  assumed  to  denote  a  constant— e ,g . ,  in  the  statement; 

(FA  (X)  (IF  (IN  X  BOY) (IN  X  PERSON))) 

BOY  and  PERSON  are  treated  as  constants. 

The  prenex  con jun<  tive  normal  form  of  a  statement  E  is  a  list  of 
clauses  (Cl  ...  Cn)  where  each  Ci  is  a  list  of  literals  (LI  ...  Lm). 

First  PRENEX  calls  PRENEX1  with  the  argument  E.  PRENEX1  performs 
the  following  tasks: 

(1)  If  E  =  (NOT  e)  then  ( PRENEX 1  e)  is  called.  A  flip-flop 
switch  is  set  to  indicate  that  e  is  negated.  This  has 
the  effect  of  taking  negation  symbols  inside  the  ex¬ 
pression. 

(2)  If  E  =  (AND  el  ...  en)  or  E  =  (OR  el  . . .  en)  then 
(PRENEX1  ei)  is  called  for  each  i,  i  -  1,  ....  n. 

The  resulting  clause  lists  are  combined  into  a  single 
OR  or  AND  list  of  the  results.  The  negation  flip- 
flop  is  used  to  avoid  multiple  negations. 

(3)  If  E  =  (IF  el  e2),  (IMP  el  e2),  (IFF  el  e2),  or 

( EQV  el  e2)  tlnn  the  connectives  are  transformed 
into  connectives  using  AND,  OR,  and  NOT,  and  PRENEX! 
of  the  transformed  expression  is  called — e.g., 

(IF  el  e2)  is  transformed  into  (OR  (NOT  el)  e2). 

(4)  If  E  is  a  quantified  expression: 

(a)  If  the  essential  quantifier  is  universal 
(FA  if  the  negation  flip-flop  is  not  set 
and  EX  if  the  switch  is  set)  the  variables 
xl,  ...  xn  are  added  to  a  free  variable 
list  and  (PRENEXI  e)  is  culled, 

(b)  if  the  essential  quantifier  is  existential, 
each  of  the  variables  xl,  ....  xn  is 
bound  to  a  generated  Skolem  function  whose 
arguments  are  all  of  the  variables  on  the 
free  variable  list  at  this  point,  and 

( PRENEXI  e)  is  called. 

(5)  If  E  =  (P  al  ...  an)  is  an  atomic  formula,  then: 

(a)  All  of  the  variables  in  E  which  occur  on  the 
list  of  bindings  arc  replaced  by  their  cor¬ 
responding  Skolem  functions. 
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(b)  The  expression  (((P  al  ...  an)))  or 

((NOT  (?  al  ...  an)))  is  returned  as  the 
value  of  PIUSNEX1  of  E  (according  to  the 
flip-flop),  and  is  the  conjunctive 
normal  form  of  E  =  (P  al  ...  an). 

When  PilENEXl  exits  with  the  list  of  clauses  corresponding  to  the 
original  statement  E,  PKENEX  performs  a  tautology  and  subsumption  check 
on  the  clauses  so  as  to  return  a  list  of  non- tau to logic,  inequivalent 
clauses,  each  of  which  is  of  minimal  length. 

2 .  The  Subsumption  Algorithm 

The  function  (SUBSUME  Cl  C2 )  determines  whether  or  not  the  clause 
Cl  subsumes  the  clause  C2.  That  is,  it  sees  whether  or  not  there  is  a 
substitution  9  such  that 


(Cl) g  C  C2 

The  following  algorithm  is  used: 

Let  Al  =  (LI  ...  Ln),  Bl  =  (Ml  ...  Mm)  be  the  literals  of  Cl  and  C2 
respectively.  Two  association  lists  ALl  and  AI,  are  used  for  keeping 
track  of  the  various  substitutions  computed  '•’•ving  the  subsumption  test. 
Initially  ALl  and  AL  are  both  set  to  NIL. 

The  function  (SUBSM1  Al  Bl)  is  called  to  see  if  the  current  literal 
list  Aj.  subsumes  the  list  Bl  with  respect  to  the  substitution  on  AL. 

(1)  If  Al  is  ever  NIL  the  test  is  successful  and  SUBSM1 
returns  true. 

(2)  If  Bl  is  ever  NIL  (Al  /  NIL)  then  SUBSM1  returns  false. 

(3)  If  neither  Al  nor  Bl  is  NIL,  the  SUBSM2  is  called  to 
see  if  the  first  literal  of  Al  subsumes  the  first 
literal  of  Bl,  whet ,  ALl  is  first  set  to  AL.  1,  the 
test  is  successful  then  the  substitution  list  ALl 

udifieii  to  include  the  match  and  then  a  test  is 
m;  ,•  (recursively  1  with  SUBSM1  called  with  Al  =  (L2  ...  Ln) , 

Bl  =  (ill  ...  Mm),  and  AI.  =  AI.l.  If  this  is  successful, 
then  SUBSM1  returns  truth.  If  this  test  is  not  success¬ 
ful,  or  if  LI  did  not  subsume  Ml,  then  SUBSM1  is  called 
with  Al  =  (LI  ...  I.11),  Bl  -  (M2  ...  Mm),  and  AI,  restored 
tc  its  previous  value. 
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The  algorithm  has  the  effect  of  first  trying  to  map  LI  into  C2, 
then  (L2)9^  into  C2  (where  is  the  substitution  that  was  needed  to 
reduce  Ll  to  a  literal  of  C2)  and  so  on.  If  at  any  point  (Li)9i_1  cannot 
be  mapped  into  C.'i,  the  process  backs  up  one  step  and  retries  on 
It  terminates  when  all  of  the  Li  are  mapped  into  C2  or  when  Ll  has  been 
tried  on  all  of  the  literals  of  C2. 

3.  The  Unification  Algorithm 

The  function  (RESOLVE  1,1  L2  Cl  C2)  produces  the  resolvent  of  clauses 
Cl  and  02  on  literals  Ll  of  Cl  and  L2  of  C2. 

For  a  resolvent  to  exist  it  is  necessary  that  Ll  and  L2  be  of  the 
form  of  complementary  literals — e.g., 

Ll  =  (P  al  ...  an),  L2  ■=  (NOT  (P  bl  ...  bn)) 


and  that 

(P  al  ...  an),  (P  bl  ...  bn)  bo  unifiable. 

The  function  (MATCH  A  B)  determines  whether  or  not  the  lists  of 
arguments  A  =  (al  ...  an),  B  =  (bl  ...  bn)  are  unifiable,  and  if  so, 
produces  the  most  general  unifier  9. 

Let  AL1  and  Al,2  be  two  association  lists,  each  initially  empty, 
which  will  hold  the  bindings  of  variables  on  A  and  B,  respectively. 
The  function  (MATCH  A  B)  performs  the  following  operations: 

(1)  The  function  VAL  applied  to  Al  and  ALl  causes  (a)  any 
variables  in  Al  not  already  on  AL1  to  be  added  to 
AL1  and  bound  to  distinct  new  variables,  and  (b)  the 
term  al'  to  be  generated  by  replacing  each  variable 
in  al  by  its  value  from  the  AL1  list.  VAL  is  then 
applied  to  bl  and  AL2 ,  except  that  new  variables  in 
bl  are  initially  bound  to  themselves. 

(2)  The  function  MATCII1  is  then  applied  to  al  '  and  bl', 
as  follows.  If  al'  is  a  variable  and  bl'  is  not, 
and  al'  does  not  occur  in  bl',  then  bl'  is  substi¬ 
tuted  for  every  occurrence  of  al'  in  the  value  of  every 
variable  on  ALl  and  AL2,  If  bl '  is  a  variable,  then 
al'  is  substituted  for  bl'  in  the  values  on  ALl  and 
AL2 .  Otherwise  MATCH1  reports  failure. 
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If  both  al  '  and  bl'  are  functional  terms,  such  as 
al'  =  (f  si  ...  sr)(  bl '  =  (g  tl  ...  tk) ,  then  we 
must  have  f  -  g  and  the  function  MATC1I1  must  suc¬ 
ceed,  recursively,  with  each  pair  of  corresponding 
arguments  from  (si  ...  sr)  and  (tl  ...  tk).  Other¬ 
wise  MATC1I1  returns  NIL. 

(3)  If  MATCH1  succeeds  in  al'  and  bl ' ,  then  MATCH  is 
called  on  the  lists  (a2  ...  an),  (b2  ...  bn).  If 
both  lists  are  empty,  the  unification  was  suc¬ 
cessful  and  the  substitutions  on  AL1  and  AL2  repre¬ 
sent  the  most  general  unifier  8. 
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